m 



M1T/LCS/TR-268 



DERIVED PAIRS, OVERLAP CLOSURES, AND REWRITE DOMINOES: 
NEW TOOLS FOR ANALYZING TERM REWRITING SYSTEMS 



John V. Guttag 
Deepak Kapur 
David R. Musser 



This blank page was inserted to preserve pagination. 



■*8jgr~ ■:■'■■■■**>■-■•■ ■■**■■ *• -- ."«-•. .^^safe-"-. .- v l. ~ . '&*^%h|S^«:»^ 



DERIVED PAIRS. OVERLAP CLOSURES, AMD REWRITE DOMINOES: 
NEW TOOLS FOR ANALYZING TERM REWRITING SYSTEMS 

J oka V. Out tag 
MIT Laboratory for Coaputer Scieace 

Deepak Kapur 
Oanaral Elactrie Researek and Developaent Center 

David R. Musser 
General Elaetric Reeearek and Development Caatar 

Abstract 
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aatioa. We also iatroduea a aew repreaentation, based *a rewrite doainoes, for 
rewrite rules aad se queue es of rewrites. 
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1. INTRODUCTION 

We introduce three new tools for the study of tern rewriting systems. 
Derived pairs of a rewrite rule generalixe the well-known idea of "critical 
pairs" introduced by Knuth and Bendiz (1970) in their development of a method of 
proving the confluence property. The overlap closure of a set of rules is a set 
of rules that corresponds to a subset of the transitive closure of the rewriting 
relation. Its construction is based on the use of derived pairs obtained from 
superpositions of the right hand side of one rule with the left hand side of 
another. This process is closely related to the Knuth-Bendix process, which uses 
critical pairs for generating new rules in an attempt to achieve confluence. We 
use the overlap closure in proving - or disproving - that a rewriting relation 
is uniformly terminating. It thus provides an interesting dual method to the 
Knuth-Bendix process, in which the validity of the critioal pair test for con- 
fluence depends upon uniform termination. The combination of uniform termina- 
tion and confluence provides a decision procedure for the theory of the equa- 
tions corresponding to the original rules. 

In the study of derived pairs and overlap closures we found it useful to 
devise a new way of representing rewrite rules and sequences of rewrites using 
what we call rewrite dominoes and "rewrite domino layouts". We will introduce 
this representation and use it in presenting the proofs of our main results 
about the overlap closure. We believe that this representation also will be 
useful in the study of other areas of rewrite rule theory. 

Like the Knuth-Bendix process, the overlap closure process may fail to ter- 
minate (that is, it may oontinue to generate new rules indefinitely). In fact, 
when the original rules are uniformly terminating, it will usually happen that 
overlap closure generation is nonterminating. In this case, the overlap closure 
process does not by itself yield a proof of uniform termination, but it may be 
useful as an aid in applying other known methods of proving uniform termination 
[see Huet and Oppen, 1980] . It can also be used in proving what we call "res- 
tricted termination," i.e., termination for all terms up to a given size. Some 
applications of restricted termination are discussed in [Out tag, Kapur and 
Musser, 1981]. 

Perhaps more important is the case where the original rules are not uni- 
formly terminating. One would often like to be able to detect this situation 
quickly, e.g., in order to avoid wasting time attempting to construct a proof of 
uniform termination. We show that under some reasonable restrictions on the 
form of rewrite rules, the overlap closure construction provides such a test. 



1. more commonly called finitely terminating or noetherian. 
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I.e., we show that if the rules are globally finite (that is to say, the number 
of different terms to which any term can be rewritten is finite) and every rule 
is right-linear or every rule is left-linear, the overlap closure construction 
can be used to effectively search for cycles in the rewriting relation. (That 
it does so "quickly" enough to be useful is a claim for which we have limited 
empirical evidence, as discussed in the Conclusion section). 
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2. DEFINITION OF OVERLAP CLOSUBE 

For the most part we use standard definitions and terminology for tern 
rewriting systems frost Huet (1980) and Hmet and Oppen (1980). There are a few 
exceptions, snoh as "uniform termination" for "finite termination," and "termi- 
nal form" for "normal form." In [Gut tag, Eapur, and Husaer, 1981], the reader 
will find a thorough discussion of this background material. Here we oonfine 
ourselves mainly to the definitions of "derived pairs," a generalixation of the 
Knuth and Bendix's notion of "critical pairs," and of "overlap closure." 

Two terms are said to overlap if one is uuifiable with a nonvariable sub- 
term of the other. If s and t overlap, we define their superposition ; either 

a) s unifies with a nonvariable sub term t' of t, by the most general unif- 
ier (m.g.u.) 0, in which case 0(t) is called a superposition of s and t; or 

b) a nonvariable subterm s' of s unifies with t, by m.g.u. 0, in which case 
0(s) is called a superposition of s and t. 

Now consider ordered pairs of terms (r,s) and (t,u) such that s and t over- 
lap, as above. (If the variables of t must be renamed, the same renaming must 
be applied to u.) Then along with the superposition 8(t) or 0(s) we obtain the 
derived pair of terms, <p,q>, where 

a) if s unifies with a nonvariable subterm t/i by m.g.u. 0, 

p»[9(t) with 0(r) at i] 1 

q-O(u) ; 

b) if a nonvariable subterm s/i unifies with t by m.g.u. 0, 

p=6(r) 

q-[9(s) with 0(u) at i]. 

In the case of a rewriting system R ■ {(1. -» r.)), the derived pairs obtained 
from the pairs (^.1^ »»«" (1 .ar^)) are called cr^ical pairs. 



1. The notation [t with u at i] stands for the term obtained 
from t by replacing the subterm at position i by u. A "sub- 
term position" and "corresponding subterm" within a ton is a 
finite sequence of nonnegative integers separated by "." and 
a related term determined as follows: to the null sequence 
(denoted <>) corresponds the entire term. If f(tj,...,t ) is 
the subterm at position i then the subterm at position i.j is 
We write t/i for the subterm at position i within term 



V 

t: 
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Consider, for example, obtaining a critical pair from the rewrite rules: 

-1 _. 
x • x— »e 

(x' • y') • z'-»x' • <y' • z') 

We begin by constructing the ordered pairs (e, x ex) and 

((x' • y') • z', x' • (y' • z')). Now x~ • x can be unified with x' • y' using 

-1 
the substitution 9 » [x /x', x/y']. This leads to the derived pair 

<e • z'.x • (x • z')> which is a critical pair of the rules. 

Using derived pairs, the overlap closure o£ 1, written OC(R), is defined 
inductively as follows: 

a. Every rule r -» s in R is also in OC(R). 

b. Whenever r -> s and t -» u are in OC(R), every derived pair <p,q> of (r,s) 
and (t,u) is in OC(R) (as p -» q) . 

c. No other rules are in OC(R). 

"Examples of overlap closures:" 

i. Let R - {f(x) -> g(x)}, then OC(R) - R. 
ii. Let R » {f(x) -> g(h(x)), h(x) -> k(x)}, then OC(D - R 

U {f(x) ^ g(k(x))}. 
iii. Let R - {x • (y • z) -» (x • y) • z} , then from the superposition 
(x • (x' • y')) • z' we obtain the rule 

x • <(x' • y») • z') -^ ((x • x') e y») • z' 
and from the superposition (x • ( (x' • y ' ) • z' ) we obtain 



x • (x' • (y' • z')) -> (x • (x' • y»)) • z' . 

These rules then lead to further rules, and 0C(£) is infinite, 
iv. Let R - {f(x) -> g(x), g(h(x)) -> f(h(x))}. Then OC(g) consists of R and 
the reflexive rules f(h(x)) -> f(h(x)) and g(h(x)) -* g(h(x)). 

The overlap closure OC(R) has a rich structure since the overlap closure 
construction preserves some properties of a rewriting system R. The following 
theorem shows that every derived pair of two rewrite rules is also a rewrite 
rule, implying that the overlap closure OC(g) is a rewriting system. 
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2.1. Theorem . If r,s,t,u are terms such that (r,s) and (t,u) are rewrite 
rules, then every derived pair <p,q> of (r,s) and (t,u) is also a rewrite rule. 

Proof . One just has to verify that for each ease in the definition of derived 
pair that every variable that occurs in q occurs also in p. D 

Let us consider some other properties, based on the properties of its 
rules, of a rewriting system R 

A term is said to be linear if no variable occurs in it more than once. A 
rewrite rule is left-linear if its left term is linear, riaht- linear if its 
right term is linear, and linear if its left and right terms are linear. 

A rewriting system is called left-linear, right-linear, or linear, based on 
whether each of its rules is left-linear, right-linear, or linear, respectively. 
The following theorem implies that the overlap closure 0C(£) of a right-linear 
(left-linear, linear) R is also right-linear (left-linear, linear). 

2.2. Theorem . If r->s and t-»u are two right linear rules with disjoint vari- 
able sets, then each of their derived pairs, <p, q> is also right linear. 

Proof . There are three cases: 

(i) s unifies with the sub term t/i Of t by their m.g.u. 0. 

The corresponding derived pair <p, q> has 

p - [0(t) with G(r) at i] 



g - 6(u) 

Since s is linear, by Lemma 1 in the Appendix, substitutions for any two dis- 
tinct variables in t/i in 9 do not have a common variable. The variables in t 
other than the ones in t/i do not play any role. So 0(u) is linear. 

(ii) the subterm s/i of s unifies with t by their m.g.u. 0. 

The corresponding derived pair <p, q> has 

p - 6(r) 
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q - [0(s) with «<u) at i] 

Since s/i is linear, by Lemma 1 in the Appendix, substitutions for any two dis- 
tinct variables in t in 8 do not have a common variable. So, 9(s) and 6(u) are 
linear, and q is thus linear. 

(iii) if subterms of s do not unify with t, or s does not unify with subtems of 
t, then there are no derived pairs of r— >s and t— >u. D 

By a similar argument, it can also be proved that every derived pair of two left 
linear rules is left linear. 

The name "overlap closure" comes from the fact that the rules of OC(R) are 
a subset of the transitive closure of the rewriting relation of £: 

2.3. Lemma . If p -» q is in OC(R) then p -> q (using 1). 

Proof . By induction on the construction of p — > q in 0C(£). The basis of the 
induction is the case that p — > q is included in OC(g) by virtue of being a rule 
of R. Then obviously p -» q holds. If (p -» q) is included in OC(R) by being 
a derived pair of (r,s) and (t,u) then by the induction hypothesis for the two 
rules (r,s) and (t,u), we have r — > s and t — > u. By the definition of 
derived pair and the transitivity of — > , we then have p — > q. D 

2.4. Corollary . If OC(R) contains a reflexive rule, t -> t, then the rewriting 
relation of £ has a cycle. 

Proof . Immediate from the above lemma. D 

We would like to have the converse of this corollary, that if the rewriting 
relation of R has a cycle, then OC(R) contains a reflexive rule. This would 
permit searching for cycles by incrementally computing OC(R), looking for a re- 
flexive rule. While we have not been able to prove this in full generality, we 
will present in the next section a restricted version and its proof. The proof 
is not easy, because the overlap closure of £ is in general much smaller than 
the full transitive closure of R. It is this small size, relative to the tran- 
sitive closure, however, that makes it feasible to use the overlap closure as 
the basis of an approach to proving uniform termination or, at least, a useful 
notion of "restricted termination," discussed in [Guttag, Kapur, and Musser, 
1981]. 
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3. REWRITE DOMINOES AND THE fff tm pyyffl-ftp q/ffiTO 'Tff^F* 

In order to be able to prove the major result about the overlap closure, we 
need to be able to deal precisely with the various oases of overlap between suc- 
cessive applications of rewrite rules in a rewrite sequence. We have found it 
useful to introduce a new representation of rewriting that helps to aake such 
cases clear. 

The domino representation (or rewrite domino ) of a rewrite rule is a rec- 
tangle divided into left and right halves in which are inscribed tree represen- 
tations of the left and right terms of the rule. Function symbols in the terms 
are represented by labelled circles in the trees. Variable symbols are 
represented by labeled rectangles, called "variable boxes." For examples of some 
rules and their corresponding rewrite dominoes, see Figure 1. 

For each kind of domino (that is, each domino corresponding to a speoific 
rule), we assume there is an infinite stock of dominoes of that kind with their 
variable rectangles filled in with all possible terms. For each such domino, we 
also assume an infinite number of copies are available in the stock. 

A sequence of rewrites oan be represented by a domino layout , which is a 
two-dimensional arrangement of dominoes that obeys the rules of matching 
corresponding to those of term rewriting (Section 2). Before giving the formal 
definition of a layout, we refer the reader to an example of a rewrite sequence 
using the rules given in Figure 1 and its corresponding domino layout as shown 
in Figure 2. Another example is in Figure 3, and the two layouts in Figures 2 
and 3 could be concatenated to give a single longer layout. 

We draw trees oriented sideways with the root at the left, and we will use 
nested triangles to represent trees schematically. We define a unit layout from 
t to w to be a horizontal arrangement of a tree t, a domino with trees u and v, 
and another tree w, 
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Figure 1. A set of rewrite rules and their corresponding rewrite dominoes. 
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1. at some position, i, in t there is a subtree t' that is identical to u, ig- 
noring the variable boxes that appear in u; 

2. the roots of t' and n are horizontally aligned; 

3. v is the tree [t with v at i] and the roots of t and w are horizontally 
aligned. 

A layont from t to v is defined as 

1. a unit layout froa t to v; or 

2. the concatenation of a layout from t to u with a layout frost u to v, with 
both copies of u dropped fron the arrangement; or 

3. any arrangement obtained from a layout by translating horizontally any dom- 
ino, as long as no other domino or end tree is overlaid or crossed (this 
allows compaction of a layout by plaoing one domino above another when they 
match disjoint sub terms). 

The examples in Figures 2 and 3 illustrate a number of observations we can 
make about this representation of rewriting: 

1. In a domino layout there is no distinction between different orders of 
rewriting when the rules are being applied to disjoint subterms; e.g., the 
layout in Figure 3 would not be different if rule 5 had been applied before 
rule 4 or before rule 3. One can think of these rules being applied in 
parallel, since the order of application is always immaterial in this case. 
The layout representation just makes this property especially evident. 

2. To the property that "the rightmost term of a rewrite sequence is terminal" 
corresponds the property that "there is no way to play a domino on the lay- 
out" (formally, there is no way to concatenate a unit layout onto the lay- 
out). The layout is said to be blocked . (The layout in Figure 3 is 
blocked.) 

3. Thus the rules have the uniform termination property if and only if every 
possible layout eventually is blocked. Equivalently, there are no infinite 
layouts. 

Our purpose with this representation of rewriting is to provide a conceptu- 
al tool for finding and presenting proofs of new results about term rewriting 
systems. The first result we will prove with the aid of rewrite dominoes is one 
that will allow us to speed up the search for cycles by considering only those 
sequences of rewrites in which a "major rewrite" occurs. 

A rewrite t. -> t.. is called a major rewrite if it is by application of a 
rule, t -* u, to the entire term t.; i.e., for some substitution 0, 0(t) » t. 
and 8 (u) ■ tj. When only a proper snbterm of t. is matched, t_ — » t 1 is called 
a minor rewrite . 
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In a layout, a domino is called a major domino (of the layout) if it 
represents a major rewrite, and a minor domino otherwise. Pictorially, major 
dominoes are those that span the width of the layout. 

A ma j or cycle is a oycle in which at least one of the rewrites is major. 

3.1. Theorem . If a rewriting relation has a cycle, it has a major cycle. 

Proof . Let us define the corridor of a domino in a layout to be the horizontal 
strip across the layout determined by the position and width of the domino: 



Any two corridors in a layout are either disjoint or one is contained in 
the other. Therefore, we can find a corridor that is spanned by a domino and 
which contains a layout as follows: start with any leftmost domino and follow 
its corridor to the right; whenever a domino is encountered that doesn't lie in 
the corridor, adopt its corridor. When we reach the right end, we have a corri- 
dor containing a layout including a domino that is major with respect to it. If 
the whole layout ia cyclic, the identified layout will be also, and will 
represent a major cycle. Q 

We now want to define some terminology and some manipulations of layouts that 
will be useful in proving theorems about the overlap closure of a set of rules. 
Consider an adjacent pair of dominoes in a layout. Let t and u be the trees on 
the adjacent halves, where a subtree t' of t is identical to u (possibly 

t' - t): 




<a 



OR 



<0 




If either of t' or u is contained entirely within a variable box, i.e., the 
match is not between two nonvariable sub terms, we say that the pair of dominoes 
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is weakly matched , and otherwise that it is strongly matched . 
Examples . In Figure 3, the domino pair 




is weakly matched. Similarly the pair 
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that appears in the concatenation of the layouts of Figures 2 and 3 is weakly 
matched, while all the other adjacent pairs are strongly matched. 

Now suppose we have two weakly matched dominoes, as in Figure 4a, where t' 
is contained in the x variable box. If the (s,t) domino is right-linear (i.e., 
t is linear), then the pair of dominoes can be transposed as follows: remove the 
(u,v) domino from the layout and move the (s,t) domino to the right, so that 
copies of the (u,v) domino can be inserted to the left of the (s,t) domino, one 
adjacent to each x box in s (see Figure 4b). Then the resulting configuration 
is still a layout , (the dominoes all match, using the same set of rules) with 
the same end trees. This is the case also when a symmetric kind of transposi- 
tion is performed on the layout in Figure 5a, producing the layout in Figure 5b, 
where we assume that the (u,v) domino is left-linear. 



Such transpositions cannot necessarily be performed on strongly matched 
dominoes, but we will define a different kind of manipulation for this case. 
Strong matching corresponds to the concept of overlapping in the definition of 
derived pairs: if (r,s) and (t,u) are rules that have a derived pair <p,q>, 
then the dominoes corresponding to (r,s) and (t,u) can be placed in a layout so 
that they are strongly matched. The layout configuration shows just where the 
strong match ocours and identifies a potential derived pair. 
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Suppose now that instead of our stock of dominoes eorresponding to a given 
rule set R, we have a stock corresponding to OC(R), the overlap closure of £. 
Then for any strongly matched pair of doainoes in a layout there is a doaino in 
our stock which corresponds to a derived pair generated by the matching pair. 
By Leaaa 3 proved in the Appendix, we can replace the strongly Batched pair in 
the layout by the "derived pair doaino" thus identified, and the result will 
still be a layout with the saae end trees. 



We are now in a position to prove: 



3.2. Theorem . Suppose the rewriting relation of £ is globally finite and every 
rule in R is right-linear. If the rewriting relation of 1 has a cycle, OC(R) 
contains a reflexive rule. 

Proof . (By construction.) Let 



(*) 



*©"**!"* > *»■* *0 



be a given cycle. Corresponding to (*) is a cyclic doaino layout 




where the doainoes correspond to rules of R. In fact since each of these rules 
is also in OC(R), we aay take this layout as a layout of doainoes corresponding 
to rules of OC(R) . fe will show how to aanipulate this layout to a fora that 
shows there is a reflexive rule t -> t in OC(R) . 

We describe the manipulations as an algorithm operating on the cyclic lay- 
out (**). 

Step 1. [Extraot major cycle.] As in the proof of Theorem 5.1, extract from 
(**) a sublayout representing a major cycle, making it the layout subject to the 
following steps. Also replace t fl with its subterm matched by the layout. 

Step 2. [Push major domino s to right end.l Manipulate the layout to a form in 
which all of the major dominoes are together at the right end, by means of tran- 
spositions or replacements by derived pair dominoes: whenever D is a major domi- 
no and E is a minor domino adjacent to J) on the right 
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either D and E are weakly matched, in which case they can be transposed, or they 
are strongly matched, in which case they can be replaoed by the derived pair 
domino they define - which is a major domino. This derived pair domino is also 
right linear, as Lemma 1 in the Appendix shows. 

Step 3. [Look for cycle among major dominoes.] There is now a nonempty sequence 



of major dominoes D 1 



, . . . , 



D at the right end of the layout: 



<3 oD 



aa 



D, 



0<^ 



These dominoes can only be strongly matched - except for the case where the 
right-hand side of D £ is just a variable, but shortly we will show that such a 
possibility can be ruled out. If there is some contiguous subsequence 
D. ,... D. that forms a cyclic layout 



<3E]"-0<^l 



then, since there can only be strong matches, these dominoes can be combined by 
j - i + 1 replacements into a single domino j> that forms a cyclic layout: 



<3H<3 



Let D represent (p,q). Then there is a substitution 8 such that u - 6(p) and 
©(q) « u , i.e., e unifies p and q. Furthermore, a derived pair of (p,q) and 



(p,q) is the reflexive rule (O(p), O(q)). 
the algorithm. 



Since this is in OC(R), we terminate 



Step 4. [Duplicate.] If no such subsequence exists, construct a copy of the 
layout adjacent to it and return to Step 2 with the resulting layout: 
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That concludes the statement of the algorithm. Before considering the question 

of termination of the algorithm, we dispense with the detail mentioned in 

Step 3: the case of adjaoent major dominoes D and £ where the right term u of g 
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is just a variable. We can as sue the left tent t of £ is not just a variable 
(if it were then it would have to be the same variable as u and we would already 
have a reflexive rule). Since the layout is cyclic, if we drop fi from the lay- 
out, we obtain a layout that has as its right end t em a proper sub term identi- 
cal to the left end term, from this we conclude that the tens rewriting rela- 
tion is not globally finite, contrary to assumption. This contradiction rules 
out the case under discussion. 

It is obvious that each step of this algorithm is effective and terminat- 
ing. Overall termination is guaranteed by the following facts: 

a. At the k th execution of Step 2, the number of major dominoes, m, at the 
right end is at least 2 k . 

» 

b. Let t Q [kl denote the term to the left of J) in the layout at the k th exe- 
cution of Step 3. Sinoe eaoh t^Ek] is derived from t^ and the rewriting 
relation is globally finite, there are only finitely many distinct possi- 
bilities for t Q [k]. By a), then, there is one §uoh term for which arbi- 
trarily long layouts of major dominoes exist. Again by global finiteness, 
these layouts cannot all continue without producing a tern. u Q , that is a 
duplicate of some term previously obtained in the layout. 

Since the algorithm always terminates, and does so with a reflexive rule in 
OC(Jl), this proves the theorem. Q 

The corresponding theorem obtained by replacing "right-linear" by "left- 
linear" can also oe proved in a similar manner. Combining these theorems with 
Corollary 4.3, we have: 

3 - 3 - Th«P*f». Suppose the rewriting relation of 1 is globally finite and every 
rule in £ is right-linear or every rule in I is left-linear. Then the rewriting 
relation of R is uniformly terminating if and only if 0C(£) contains no reflex- 
ive rule. 

Some applications of this theorem are explored in [Gut tag, Kapur, and Musser, 
1981] 

Recently, Dershowitz (1981) has propossed a "forward chain" construction for 
rewriting systems and proved that a right-linear rewriting system is uniformly 
terminating if and only if it has no infinite forward chains. However, for 
left-linear systems the analogous result requires that the left-hand sides of 
the rules be nonoverlapping, a problem that we had independently encountered 
when considering the forward chain oonstruetion and a similar backward chain 
construction. We were thus led to invent the overlap closure construction. The 
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following example from Dershowitz (1981) illustrates the advantage of the over- 
lap closure construction over forward chains. Using the forward chain construc- 
tion, it is not possible to determine the nonteraimation of this left-linear 
rewrite system, as pointed out by Dershowitz. The rewriting system is 

f(a(), b(), x) -» f(x, x» b()) 
b() -» a(). 
These rules have only two forward chains, both finite: 

f(a(), b(),x) =» f(x, x, b()) =* f(x, x. i()), and b( ) $ a (), 

but we cannot conclude anything about the termination of the rules because they 
are not right-linear and, although they are left-linear, the left-hand sides are 
overlapping. But in the overlap closure construction, the rules have a derived 
pair rule 

f(b(), b(), x) -> f(x, x, b()), 
which, when overlapped with itself, gives the reflexive rule 

f(b(), b(), b()) -> f(b(), b<), b() )), 
as a derived pair, proving that the rules are non terminating. 
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4. CONCLUSION 

We have discussed two ways to Bake use of finite subsets of the overlap 
closure: proving restricted termination and disproving uniform termination. We 
have explored, without much success, using such finite subsets as parts of 
proofs of uniform termination. We conjecture that for certain classes of term 
rewriting systems it should be possible to oompute a bound, n, such that if a 
cycle exists, there exists a cycle in which every term is of size n or less. 
For such classes, the overlap cloture would provide a decision procedure for un- 
iform termination. 

Another open question about the generality of the overlap closure construc- 
tion is whether the assumption of left-linearity or right-linearity is neces- 
sary. Although we have not been able to find proofs of our results without this 
assumption, we have also been unable to construct a counterexample. In any 
case, as disoussed above, the overlap closure construction is more general than 
eitner forward or backward chain constructions. 

For the class of term rewriting systems to which it may be applied, con- 
structing the overlap closure is as useful as constructing the complete transi- 
tive closure. Furthermore, using the overlap closure to show restricted termi- 
nation or the absence of uniform termination will always involve computing fewer 
terms than would using the transitive closure. We do not yet have much empiri- 
cal or analytical evidenoe as to the absolute efficiency of using the overlap 
closure for these purposes. The key question is how many terms must be examined 
in order to demonstrate that no cycle is possible for terms of up to size n. 
The few examples we have tried, using a preliminary implementation, we have 
found encouraging. 

The basic idea of conducting a search for repeated terms (cycles) or sub- 
terms sprang from disoussions in 1977 between one of the authors (Husser) and 
Dallas Lankford. We thank P. Oloess, 6. Huet, and J. Levy for their interest 
and assistance in refuting some of our earlier conjectures, thus helping us ar- 
rive at the notion of the overlap closure and the theorems of Section 3. We 
also thank P. Narendran for assistance in constructing the proof of the theorems 
in the Appendix and J. Goguen for discussions of the approach to term rewriting 
used in OBJ. John Gut tag's research is supported in part by the National Science 
Foundation under grant MCS78-01798 and by an Office of Naval Research Contract 
with DARPA funding #N00014-75-C-06"61 . 
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1. Le— a . Let t and u be unifiable terns with disjoint variable sets, and be 

their most general unifier. Let 6* be the restriction of 6 to the variables of 

u, say 0* ■ [e,/v, ,...,e /v ] . If t is linear, then all variables in e.,,...,e 
linn in 

are distinct. 

Proof . For every variable x having k (>1) occurrences in u, replace different 
occurrences of x by distinct variables x , ...,x. that do not appear in t and u. 
Let n' be the resulting tens, which is linear. 

By Lena 2, in the n.g.u. 0' of t and u', substitutions for distinct variables 

in t and u' do not have a eoaaon variable. Let a be the m.g.u. for the set of 

x • 

terns 8'(x,), l<i£k, the substitutions for the variables used to replace nulti- 
ple occurrences of x in u. If these a for every variable x having nultiple oc- 
currences in u are conposed with 6', we get a unifier of t and u. 

In this unifier, substitutions for variables in u do not have a connon variable. 
Fron this, it is evident that the n.g.u. © of t and u cannot have substitutions 
for variables in u that share connon variables. Q 

2. Lenna . For two unifiable terns t and u, if t and u are linear, then the sub- 
stitutions in their n.g.u. 6 for any two distinct variables of t or u do not 
have connon variables. 

Proof . By induction on the structure in tern t. 

Basis: t is a variable. 

Then 0(t) - u and the statenent trivially holds. 

Inductive step: t — f(t, ,...,t ) 

l n 
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For t and u to be unifiablo, either u is a variable or u™ f(u, ,...,u ). The 

l n 

ease of u being a variable is handled as in the basis step. 

For the case u ■» f(u.,,...,u ), for each i, Ki<n, t. Must unify with u, by 

in i i 

their m.g.u. 9., say. By the inductive hypothesis, the statement holds for each 
of 0,. Since t and u are linear, the disjoint union of 9,, l£i£n, is the B.g.u. 
6 of t and u. It follows that the statement of the lemma holds for 9 also. D 



3. Lemma. Suppose t. — > t. using r — > s applied at position i, t- — > t. using 
t — > u applied at i.j, and s/j and t overlap determining the derived pair 
<p,q> - <9(r),[9(s) with 9(u)at j]>. Then t Q -> t, using p -» q applied at i. 
A similar result holds for the case in which s unifies with a subterm of t. 

Proof . Rename the variables of t and u, if necessary, so that s and t have no 
variable in common. There is some subterm t./i and a substitution 9- such that 
9 1 (r) = t Q /i and t ± - [t Q with 9As) at i]. 

Again, there is some subterm t ] /(i.j) and a substitution 9, such that 
9 2 (t) - tj/Ci.j) and t 2 - [t ± with 9j (u) at i.j]. 

Since the variables of s and t are disjoint, we have (9. U 9.) (s/j) = BAt/j) - 
9 2 (t) ■ (9 X U 9 2 )(t). That is. 9 X U 9 £ is a unifier of s/j and t and therefore 
has 9 as a factor: 



Sj D 8, « «, i 9, for some substitution 9.. 
Thus t Q /i - 9 1 (r) - (9 X U 9 2 ><r) - <9j e 9) (r) - 9 3 <9(r)) - 9 3 <p). That is, t Q 
is matched by p at i. Now consider 9,(q); it is 



9 3 ([9(s) with 9(u) at j]) 

- E9 3 (9(s)) with 9 3 (9(u)) at j] 
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= [e 1 (s) with e 2 (u) at j]. 

Thus t„ = [t with © 2 (u) at i.j] 

= [[t Q with G (s) at i] with ^U) at i.j] 

= [t Q with [6 1 (s) with 6 2 (u) at j] at i] 

= [t n with 9,(q) at i], showing that 
t — H- using p— >q applied at i. We omit the proof of the case in which s uni- 

fies with a subterm of t. □ 
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